Developments in continuous speech dictation using the ARPA WSJ task

نویسندگان

  • Jean-Luc Gauvain
  • Lori Lamel
  • Martine Adda-Decker
چکیده

In this paper we report on our recent development work in large vocabulary,American English continuous speech dictation. We have experimented with (1) alternative analyses for the acoustic front end, (2) the use of an enlarged vocabulary so as to reduce the number of errors due to out-of-vocabulary words, (3) extensions to the lexical representation, (4) the use of additional acoustic training data, and (5) modification of the acoustic models for telephone speech. The recognizer was evaluated on Hubs 1 and 2 of the fall 1994 ARPA NAB CSR Hub and Spoke Benchmark test. Experimental results for development and evaluation test data are given, as well as an analysis of the errors on the development data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The use of confidence measures in unsupervised adaptation of speech recognizers

Confidence estimation of the output hypothesis of a speech recognizer offers a way to assess the probability that the recognized words are correct. This work investigates the application of confidence scores for selection of speech segments in unsupervised speaker adaptation. Our approach is motivated by initial experiments that show that the use of mis-labeled data has a significant cost in th...

متن کامل

Developments in Large Vocabulary Dictation : The LIMSI Nov 94 NAB System yJ

In this paper we report on our development work in large vocabulary , American English continuous speech dictation on the ARPA NAB task in preparation for the November 1994 evaluation. We have experimented with (1) alternative analyses for the acoustic front end, (2) the use of an enlarged vocabulary of 65k words so as to reduce the number of errors due to out-of-vocabulary words, (3) extension...

متن کامل

Developments in continuous speech dictation using the 1995 ARPA NAB news task

In this paper we report on the LIMSI recognizer evaluated in the ARPA 1995 North American Business (NAB) News benchmark test. In contrast to previous evaluations, the new Hub 3 test aims at improving basic SI, CSR performance on unlimitedvocabulary read speech recorded under more varied acoustical conditions (background environmental noise and unknown microphones). The LIMSI recognizer is an HM...

متن کامل

The LIMSI continuous speech dictation system: evaluation on the ARPA Wall Street Journal task

In this paper we report progress made at LIMSI in speakerindependent large vocabulary speech dictation using the ARPA Wall Street Journal-based CSR corpus. The recognizer makes use of continuous density HMM with Gaussian mixture for acoustic modeling and n-gram statistics estimated on the newspaper texts for language modeling. The recognizer uses a time-synchronous graph-search strategy which i...

متن کامل

Automatic Speech Recognition and its Application to Information Extraction

This paper describes recent progress and the author's perspectives of speech recognition technology. Applications of speech recognition technology can be classified into two main areas, dictation and human-computer dialogue systems. In the dictation domain, the automatic broadcast news transcription is now actively investigated, especially under the DARPA project. The broadcast news dictation t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995